GBE 



Strong Mutational Bias Toward Deletions in the Drosophila 
melanogaster Genome Is Compensated by Selection 

Evgeny V. Leushkin 1,2 '*, Georgii A. Bazykin 1,2 , and Alexey S. Kondrashov 1,3 

department of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia 
institute for Information Transmission Problems of the Russian Academy of Sciences, Moscow, Russia 
3 Life Sciences Institute and Department of Ecology and Evolutionary Biology, University of Michigan 
^Corresponding author: E-mail: leushkin@gmail.com. 
Accepted: February 5, 2013 

Abstract 

Insertions and deletions (collectively indels) obviously have a major impact on genome evolution. However, before large-scale data 
on indel polymorphism became available, it was difficult to estimate the strength of selection acting on indel mutations. Here, we 
analyze indel polymorphism and divergence in different compartments of the Drosophila melanogaster genome: exons, introns of 
different lengths, and intergenic regions. Data on low-frequency polymorphisms indicate that 0.036-0.039 short (1-30 nt) insertion 
mutations and 0.085-0.092 short deletion mutations, with mean lengths 3.23 and 4.78, respectively, occur per single-nucleotide 
substitution. The excess of short deletion over short insertion mutations implies that indel mutations of these lengths should lead to a 
loss of approximately 0.30 nt per single-nucleotide replacement. However, polymorphism and divergence data show that this 
deletion bias is almost completely compensated by selection: Negative selection is stronger against deletions, whereas insertions 
are more likely to be favored by positive selection. Among the inframe low-frequency polymorphic mutations in exons, long introns, 
and intergenic regions, selection prevents a larger fraction of deletions (80-87%, depending on the type of the compartment) than of 
insertions (70-82%) or single-nucleotide substitutions (49-73%), from reaching high frequencies. The corresponding fractions were 
the lowest in short introns: 66%, 47%, and 15%, respectively, consistent with the weakest selective constraint in them. The 
McDonald-Kreitman test shows that 32-46% of the deletions and 60-73% of the insertions that were fixed in the recent evolution 
of D. melanogaster are adaptive, whereas this fraction is only 0-29% for single-nucleotide substitutions. 
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Introduction 

The genome of Drosophila melanogaster is under pervasive 
selection. On the one hand, among the de novo nucleotide 
substitutions (single-nucleotide changing mutations), approx- 
imately 90% of nonsynonymous mutations (Eyre-Walker and 
Keightley 2009), and >50% of mutations in intergenic re- 
gions and long introns (Andolfatto 2005; Halligan and 
Keightley 2006; Casillas et al. 2007), are under negative se- 
lection strong enough to radically reduce their probability of 
fixation. On the other hand, a substantial number of the re- 
placements (i.e., mutations fixed between species) are driven 
by positive selection, although the estimates of the fraction of 
adaptive replacements are still discordant (Andolfatto 2005; 
Eyre-Walker and Keightley 2009; Parsch et al. 2010; Mackay 
et al. 2012). Only short introns and, to a lesser extent, synon- 
ymous sites appear to be mostly free of strong selective 



constraint (Halligan and Keightley 2006; Haddrill et al. 2007; 
Parsch et al. 2010; Clemente and Vogl 2012). 

The other common class of mutations, insertions and de- 
letions (indels), are also affected by selection. In short introns 
of D. melanogaster, deletions are disfavored by selection be- 
cause there is a lower limit on the intron length (Andolfatto 
2005; Presgraves 2006; Wang and Yu 201 1). In long introns 
and intergenic regions of D. melanogaster, polymorphic inser- 
tions were shown to have higher frequencies than deletions, 
suggesting selection against the latter (Ometto et al. 2005). 
Positive selection caused an increased indel replacement rate, 
relative to that in intergenic regions, in accessory gland pro- 
teins of D. pseudoobscura subgroup (Schully and Hellberg 
2006) and, outside Drosophila, in a rodent sperm protein 
Catsperl (Podlaha et al. 2005). Finally, Chen et al. (2009) 
have shown that approximately 10 Mb of the human 
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genome contain multiple indels that were fixed under positive 
selection. 

Any study of selection affecting indels is complicated by 
difficulties in determining the indel mutation rates. These 
rates depend on the indel length and are known only approx- 
imately. According to the data on single-nucleotide and indel 
replacements in non-LTR transposons of D. melanogaster, 
mutation rates for insertions (deletions) are approximately 
0.015-0.05 (0.1-0.2) of that for single-nucleotide substitu- 
tions, depending on the data set (Blumenstiel et al. 2002; 
Petrov 2002). The mean insertion length, 5.6 nt, is less than 
the mean deletion length, 33.0 nt (Blumenstiel et al. 2002), 
but these estimates are susceptible to errors due to long 
indels, which are rare. The parameters of indel evolution, for 
example, mutation rate, insertions to deletions ratio, and 
mean indel size, differ between species. Nevertheless, some 
general rules exist: deletion mutations are always more fre- 
quent than insertion mutations, the genomic prevalence of 
indels declines with length, and this decline is faster for inser- 
tions than for deletions (Lynch 2007; Messer and Arndt 2007). 
This implies a deletion bias, which needs to be compensated 
by opposing forces to prevent the genome from unrestrained 
contraction (Petrov and Hartl 1998; Petrov 2002). 

Here, we take advantage of the whole-genome data set 
on 1 58 D. melanogaster genotypes (Mackay et al. 201 2) and 
data on interspecies divergence to get precise estimates of 
relative mutation rates for indels of different lengths and to 
investigate the action of negative and positive selection on 
short indels. 



Materials and Methods 

Data 

Full-genome alignments of D. sechellia and D. erecta to 
D. melanogaster (dm3, BDGP release 5) were downloaded 
from the UCSC database http://hgdownload.cse.ucsc.edu/ 
goldenPath/dm3/multiz15way/ (Clark et al. 2007; Heger and 
Ponting 2007). Exons, introns, and intergenic regions were 
extracted from the alignments according to the FlyBase anno- 
tation (ver. 5. 1 2) (Crosby et al. 2007). Exons and introns were 
defined according to the canonical splice isoforms; annotated 
UTRs were excluded from the analyses. A nucleotide was 
considered noncoding, that is, intergenic or intronic, if it 
was not present in any known splice isoform. Illumina se- 
quence reads mapped to D. melanogaster genome (Mackay 
et al. 2012, http://www.hgsc.bcm.tmc.edu/projects/dgrp/ 
freeze1_July_2010/lllumina/) were used to identify polymor- 
phisms. Because lower read lengths may complicate indel call- 
ing, we only considered the 1 58 of 1 62 D. melanogaster lines 
that were sequenced in reads of 75 nt or more; lines 
DGRP-427, DGRP-437, DGRP-774, and DGRP-786 sequenced 
with reads of 45 nt were not analyzed. 



Identifying Polymorphism and Divergence in 
D. melanogaster 

The quality of single-nucleotide polymorphism (SNP) and indel 
calling is critical for our results. Indel calling from short reads 
next-generation sequencing data is especially difficult, be- 
cause the presence of polymorphic indels obscures mapping 
of short reads. We have tried three alternative ways to do it. In 
the first approach, we did the SNP/indel calling on a 
per-individual basis using pileup from SAMtools package 
(Li et al. 2009) (v. 0.1.11, http://samtools.sourceforge.net/) 
with the default parameters. In the second approach, after 
the initial run of pileup, we realigned the reads that mapped 
to indel-carrying regions using GATK (McKenna et al. 2010) 
and then did the calling on these realigned reads in a second 
run of pileup. In the third approach, we did multisample SNP 
and indel calling with mpileup (v. 0.1.17) with the default 
parameters on all genotypes at once. In all procedures, indel 
calls were filtered according to the Phred quality score; only 
the calls with the PhredScore> 10 for an alternative variant 
were retained. We found that the first and the second proce- 
dures, which do indel calling on a per-individual basis, led to a 
large number of false negatives, that is, missed indels. In con- 
trast, in the third procedure, which uses information from 
all individuals simultaneously, both the numbers of false pos- 
itives and false negatives were very low (see below). Visual 
inspection using IGV genome browser (Robinson et al. 201 1) 
also revealed the high quality of the resulting alignments. 
Therefore, we used the third procedure for all analyses. 

Alignment segments containing indels polymorphic in 
D. melanogaster were realigned with MUSCLE (Edgar 2004) 
to increase the quality of alignment with D. sechellia and 
D. erecta. Drosophila sechellia and D. erecta were used to 
infer the ancestral state; indels or SNPs were discarded 
when these two outgroups disagreed or if more than 50% 
of D. melanogaster individuals had no data for this genomic 
segment. An indel or a single-nucleotide substitution was con- 
sidered to be fixed in the D. melanogaster lineage after its 
divergence from the D. melanogaster-D. sechellia common 
ancestor if no D. melanogaster line carried the ancestral var- 
iant. We discarded from our analysis regions masked by 
Repeatmasker. We also discarded overlapping indels and non- 
biallelic SNPs and, to further increase our confidence in the 
genomic positions of polymorphisms, required two alignment 
positions without gaps (one from each flank) to be immedi- 
ately adjacent to an indel polymorphism or a SNP. 

We analyzed indels with lengths up to 30 nt. Indels were 
categorized into exonic, intergenic, and intronic. Exonic indels 
were further categorized into nonframeshifting (with lengths 
in multiples of 3) and frameshifting (with lengths not in mul- 
tiples of 3). Intronic indels were categorized into those that 
occurred in long (>300nt) introns, in short (70-300 nt) 
introns, and in very short (<70nt) introns. Indels that over- 
lapped the boundaries of compartments (e.g., an exon-intron 
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boundary) were excluded from analyses. The total numbers 
of polymorphic and fixed indels for each kind of genome 
compartments are shown in supplementary figure S1, 
Supplementary Material online. 

We used two methods to estimate the quality of indel and 
SNP calling. First, we reasoned that because the selection 
against the frameshifting indels and the nonsense mutations 
in conservative genes is the strongest, such mutations are 
expected to be the rarest, and therefore most prone to se- 
quencing and assembly errors. Therefore, we conservatively 
assumed that all such mutations represent sequencing or as- 
sembly errors and estimated the upper threshold for the frac- 
tion of erroneously called mutations in intergenic regions 
under this assumption. For this, we selected the 3,813 most 
conservative genes (with d/V< 0.003 in D. melanogaster line- 
age), each with no less than 500 nonsynonymous sites, and 
used the ratio of the number of indels (nonsense mutations) 
per nucleotide site in these genes to the number of indels 
(mutations to TAA, TAG, or TGA triplets) per nucleotide site 
in intergenic regions to estimate the upper bound for the 
fractions of false-positive indels and single-nucleotide substi- 
tutions. Triplets from intergenic regions were drawn with rel- 
ative frequencies corresponding to codon usage in exons. The 
obtained ratios were 0.048, 0.020, and 0.012 for insertions, 
deletions, and single-nucleotide substitutions, respectively, 
and only approximately 0.007 and approximately 0.008 for 
indels and single-nucleotide substitutions with frequencies 
above 15%, implying that the fraction of erroneous indel 
calls is low. 

Second, we compared the results of indel calling for the 
same line (D. melanogaster line DGRP-859) obtained using 
two sequencing methods: lllumina and 454. Only 1.3% 
(0.7%) of insertions (deletions) called from lllumina data 
were not supported by the data from 454 sequencing 
("false positives"; supplementary tables S1 and S2, Supple- 
mentary Material online). The number of indels observed in 
454-based calling, but not observed in our indel calling ("false 
negatives"), is very low (0.47% for insertions and 0.06% for 
deletions) for indels > 2 nt. For insertions and deletions of 
length 1 nt, it is rather high: 18% and 16%, respectively. 
However, the majority of these 454-only calls (190 of 198 
for insertions, and 208 of 213 for deletions) are located in 
homopolymer tracts of lengths 3 or more, usually consisting 
of A's or T's. Such regions are known to be most prone to 454 
sequencing errors (Huse et al. 2007; Gilles et al. 201 1); there- 
fore, these 454-only calls are most likely to be artifacts of 454 
sequencing. Either way, indels of length 1 nt do not affect our 
estimations of selection acting on indels, as we consider only 
indels of lengths in multiple of 3 nt. 

Inferring Negative Selection 

Negative selection reduces the prevalence of polymorphisms, 
compared with that expected under neutrality. Even negative 



selection of moderate strength can reduce the number of 
high-frequency polymorphisms (with derived allele frequency 
[DAF]>15%); however, only strong negative selection 
(A/ e s<-20, fig. 1 in Messer 2009) can considerably reduce 
the number of low-frequency polymorphisms (DAF< 15%). 
To estimate the fraction of de novo mutations eliminated by 
strong negative selection in a particular genomic compart- 
ment, we compared the prevalence of low-frequency poly- 
morphisms in this compartment and in short introns, where 
selective constraint is the weakest. We performed this analysis 
for indels and single-nucleotide substitutions separately. For 
indels, we performed this comparison for each indel length 
separately; specifically, the indels of lengths 3, 6, and so on in 
short introns were used as a reference for inframe exonic 
indels, and indels of lengths 1, 2, 4, 5, and so on in short 
introns were used as a reference for frameshifting exonic 
indels. To obtain the expected values for the amino 
acid-changing or stop-codon-inducing single-nucleotide sub- 
stitutions in exons, we randomly selected nucleotide triplets 
from short introns in proportion to the frequencies of codons 
within the coding regions and used the prevalence of the 
corresponding mutations in this set of triplets as a reference. 

Negative selection acting on segregating mutations can be 
inferred from allele frequency spectra. We devise the follow- 
ing statistic to measure the deviation of the mutation frequen- 
cies from that expected under neutrality: 



Here, P is the number of polymorphic sites in the corre- 
sponding frequency class; subscripts S and N correspond to 
the sites designated as selectively important and neutral; and 
subscripts H and L correspond to two nonoverlapping intervals 
of allele frequencies: a higher one and a lower one, respec- 
tively. If the allele distribution of the selectively important poly- 
morphisms is the same as that of the neutral polymorphisms, 
implying neutrality, E, is equal to 1 . £ < 1 indicates that the 
frequency of selectively important alleles is lower than ex- 
pected, implying negative selection. For example, ^ = 0.3 
implies that, among the selectively important alleles found in 
the low (L-) interval of allelic frequencies that would have 
reached the high (H-) interval in the absence of selection, 
70% have in fact not reached it because of negative selection. 
If the fraction of deleterious and advantageous mutations is 
negligible in the high-frequency interval and selection is neg- 
ligible in the low-frequency interval, 1 - £, can be thought of 
as the fraction of mutations prevented from fixation by weak 
or moderate negative selection. 

We calculated £(L,H) for indels and for single-nucleotide 
substitutions. To facilitate the comparisons of indels in 
coding and noncoding compartments, only indels of lengths 
in multiple of 3 were considered here. Because there is no 
well-established class of neutral insertions or deletions that 
could serve for calibration of the observed indel frequencies, 
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we used SNPs and replacements within positions 8-30 of in- 
trons with lengths up to 65 nt as a measure of P N both for the 
single-nucleotide substitutions and for the indels, following 
Halligan and Keightley (2006), Haddrill et al. (2007), and 
Parsch et al. (2010). We use DAF< 15% as the L interval 
and DAF>15% as the H interval. The 15% frequency 
cutoff was recommended for reducing the effect of slightly 
deleterious mutations in the McDonald-Kreitman test 
(Charlesworth and Eyre-Walker 2008) and is also appropriate 
here, as very few weakly deleterious mutations reach this DAF, 
whereas a sufficiently large fraction of neutral mutations falls 
into the high-frequency class. 

Currently, our simple method for inference of negative se- 
lection from allele frequencies seems preferable to Poisson 
Random Field-based methods (Hartl et al. 1994) or methods 
for estimation of fitness effects introduced by Keightley 
and Eyre-Walker (2007) in analyses involving indels. Indeed, 
those methods employ information on the mutation rate, 
whereas the indel mutation rate is known only approximately 
and is strongly dependent on the indel length. Furthermore, 
£ compares the distribution of allele frequencies in the se- 
lected sites to the spectra observed in the neutral sites, 
rather than to some theoretically expected distribution, and 
thus controls for the demographic effects, which have other- 
wise to be assessed separately (Keightley and Eyre-Walker 
2007). 



Inferring Positive Selection 

To infer the fraction of indels that were fixed under positive 
selection, we used a modification of the McDonald-Kreitman 
test (McDonald and Kreitman 1991; Smith and Eyre-Walker 
2002; Podlaha et al. 2005). To reduce the effect of weakly 
deleterious mutations on the estimates, we used only poly- 
morphisms with DAF> 15% (Charlesworth and Eyre-Walker 
2008). Therefore, the fraction of replacements that were 



driven to fixation by positive selection was calculated as 
follows: 



Here, D is the number of mutations fixed between species; 
in calculation of D, we excluded the nucleotide sites or indels 
that were polymorphic in D. melanogaster. 

Results 

Using the Prevalence of Polymorphic Indels to Estimate 
Their Relative Mutation Rates 

Data on polymorphisms that segregate at very low frequencies 
can be used to estimate the relative prevalence of mutations 
of different types, as long as the impacts of selection or biased 
gene conversion (BGC) at these frequencies are negligible 
compared with that of drift (Messer 2009). As very 
low-frequency polymorphisms, we use polymorphisms such 
that the derived allele is observed in \-A genotypes in our 
sample of 158 D. melanogaster individuals, that is, with 
DAF<3%. Short introns are less constrained by selection 
than longer introns or other genomic regions (Parsch 2003; 
Halligan and Keightley 2006; Haddrill et al. 2007; Parsch et al. 
201 0). Therefore, we first used introns with lengths 70-300 nt 
to estimate the relative mutation rates. Among the very 
low-frequency polymorphisms in such introns (fig. 1/\), the 
number of insertions (deletions) is 0.044 ±0.01 2 (0.100± 
0.021; throughout this article, the ranges correspond to 
95% confidence intervals estimated from 1,000 bootstrap 
trials) of that of single-nucleotide substitutions, the latter 
being approximately 1CT 8 per nucleotide per generation 
(Haag-Liautard et al. 2007). The average lengths of very 
low-frequency insertions and deletion found in short introns 
are 2.88 (2.30-3.37) and 4.1 1 (3.25-4.88) nt, respectively. 

However, these estimates of the indel mutation rates could 
be distorted, because a sufficiently long deletion in a short 
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Fig. 1. — Prevalence of insertions and deletions of different lengths per single-nucleotide substitution observed at very low derived allele frequencies 
(DAF < 3%) in introns of lengths 70-300 nt (A) and in intergenic regions (B). Error bars are 95% confidence intervals based on 1,000 bootstrap trials. 
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intron is likely to overlap its boundary or a splice site and also 
because very short introns are selected against (Parsch 2003). 
The second least-constrained class of contiguous genome seg- 
ments in D. melanogaster is the intergenic regions (Andolfatto 
2005), where the per site number of very low-frequency SNPs 
(0.014) is close to that in short introns (0.013). Therefore, we 
also estimated the indel mutation rates in the intergenic re- 
gions. Here, the numbers of very low-frequency insertions and 
deletions are, respectively, 0.037 ±0.002 and 0.095 ±0.003 
of that of single-nucleotide substitutions, and their respective 
lengths are 3.23 (3.04-3.43) and 4.78 (4.58-4.96) nt (fig. 1 B). 
A mean deletion is longer in the intergenic regions than in 
short introns, consistent with selection against longer dele- 
tions in short introns. 

Overall, the data on intergenic regions imply that, in the 
absence of selection, insertions of lengths 1-30 nt would 
lengthen the D. melanogaster genome by 0.12±0.01nt, 
whereas deletions of these lengths would shorten it by 
0.42 ± 0.04 nt, per each de novo single-nucleotide substitu- 
tion; together, these effects would result in a contraction of 



genome by 0.30 ± 0.05 nt. Even in the least constrained ge- 
nomic regions, selection against insertions and, in particular, 
deletions is stronger than against single-nucleotide substitu- 
tions (see later); therefore, all these values may be 
underestimates. 

Negative Selection on Indels and Single-Nucleotide 
Replacements in Different Genome Compartments 

Negative selection reduces the number of polymorphic sites, 
but only strong negative selection (A/ e s < -20) can substan- 
tially reduce the number of sites carrying low-frequency dele- 
terious alleles (fig. 1 in Messer 2009). We can estimate strong 
selection by comparing the prevalences of low-frequency poly- 
morphisms in different genomic compartments that likely 
experience different degrees of constraint. Prevalence of 
low-frequency (DAF<15%) indels is very similar between 
the intergenic regions and introns, including short introns, sug- 
gesting that the majority of new arising mutations at these 
compartments are at most weakly deleterious (fig. 2B-D, 
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Fig. 2. — Numbers of polymorphic and fixed indels and single-nucleotide substitutions in genome compartments of different kinds. Top row (A-D), 
polymorphisms with DAF < 1 5%; middle row (E-H), polymorphisms with DAF > 1 5%; and bottom row (/-/_), fixed mutations. A, E f I: exons (inframe indels 
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light gray and dark gray bars). Similarly, there is no reduction in 
the prevalence of the SNPs (fig. 2B-D, blue bars). 

In contrast, in exons, the prevalence of low-frequency 
indels is notably reduced (fig. 2/4). The prevalence of inframe 
insertions (deletions) in exons is approximately 0.45 (~0.37) 
of that in the reference compartment. The prevalence of 
low-frequency amino acid-changing substitutions is also sub- 
stantially reduced, to approximately 0.37 of that expected 
from the rate of substitutions in the reference compartment. 
The deepest reduction was observed for the loss-of-function 
mutations: The prevalence of low-frequency frameshift inser- 
tions (deletions) is only approximately 0.08 (~0.05) of that in 
the reference compartment. This is similar to the reduction 
in the prevalence of the low-frequency nonsense single- 
nucleotide substitutions, which is approximately 0.04 of the 
prevalence of the analogous types of substitutions (i.e., those 
giving rise to TAA, TGA, or TAG triplets) in the reference com- 
partment (supplementary fig. S2, Supplementary Material 
online). Both for the inframe and frameshifting indels, the 
reduction in their prevalence is independent of indel length, 
at least for lengths up to approximately 20 nt (supplementary 
fig. S3, Supplementary Material online). 

Weak negative selection (-5< A/ e s< -1), although inef- 
fective at low allele frequencies, considerably reduces the 
prevalence of high-frequency polymorphisms and sites of in- 
terspecific divergence. Indeed, although intergenic regions 
and introns of different lengths have similar prevalences of 
low-frequency polymorphisms, the pattern observed for the 
high-frequency polymorphisms and sites of divergence is rad- 
ically different. The prevalences of high-frequency and fixed 
indels and single-nucleotide substitutions are substantially 
higher within the short introns than within other genome 
compartments (fig. 2E-L), suggestive of weak negative selec- 
tion in the latter. The estimates of negative selection based on 



divergence are confounded by positive selection (see later); 
therefore, to infer weak negative selection, we used polymor- 
phism data alone. 

As long as multiple mutations at a site are rare, allele fre- 
quency spectra are independent of the mutation rates, which 
enables us to compare mutations of different types with each 
other. In line with Parsch et al. (2010), we use SNPs at posi- 
tions 8-30 of introns <65nt as the neutral standard. The 
fraction of slightly deleterious mutations can be estimated 
from the deficit of high-frequency alleles, compared with 
the value expected from the number of low-frequency alleles. 
We use the ratio of the high-frequency and low-frequency 
polymorphisms for the putatively selected mutations, divided 
by this same ratio for the neutral SNPs, as the upper bound on 
the fraction of neutral mutations in the low-frequency interval, 
and 1 - £ as the lower bound on the fraction of at least mod- 
erately deleterious mutations in the low-frequency interval 
(see Materials and Methods for details). 

Comparisons of £ for different types of mutations show 
that, within each genomic compartment, indels are more del- 
eterious than single-nucleotide substitutions, and, among 
indels, deletions are more deleterious than insertions 
(fig. 3/4). In exons, at least approximately 82% of insertions 
and approximately 87% of deletions are deleterious, com- 
pared with approximately 73% for segregating missense sub- 
stitutions; all these values are higher than those for any class of 
the noncoding regions. Outside exons, the pattern of negative 
selection is very similar in intergenic regions and in long in- 
trons: The fraction of deleterious mutations among insertions 
(~71%) is lower than among deletions (~80%) but much 
higher than among single-nucleotide substitutions (~49%) 
(fig. 3/4). Again, selection affecting all types of mutations is 
the weakest in the short (70-300 nt) introns (fig. 3/4); how- 
ever, even within this compartment, indels are much more 
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Fig. 3. — £ for indels and single-nucleotide substitutions. Low values of £ correspond to a high fraction of deleterious mutations and vice versa. {A) Mean 
5 in different genome compartments: exons (inframe indels or missense substitutions), intergenic regions, long introns (>300nt), and short introns 
(70-300 nt). (B) £ for indel mutations in short and very short introns, depending on the length of the intron after the indel. Light gray, insertions; dark 
gray, deletions; yellow, missense substitutions; and blue, single-nucleotide substitutions in noncoding regions. Error bars are 95% confidence intervals based 
on 1,000 bootstrap trials. 
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likely to be deleterious (~47% of insertions and ~65% of 
deletions) than single-nucleotide substitutions (~15%). 

Very short introns with lengths below 60 nt warrant a spe- 
cial consideration (Parsch 2003): The boundaries on the min- 
imal intron length lead to extra selection against deletions in 
this length class. Indeed, the dependence of selection against 
deletions on intron length is nonmonotonic: The fraction of 
deleterious deletions is higher in the long and in the very short 
introns than in the introns of intermediate lengths. It is the 
highest (~81 %) for deletions that give rise to introns shorter 
than 50 nt. In contrast, the fraction of deleterious insertions is 
the lowest (~10%) in this category of introns (fig. 3B). 

Positive Selection on Indels and Single-Nucleotide 
Replacements 

Figure 4 shows that positive selection on indels is ubiquitous in 
the D. melanogaster genome. The fraction a of insertions that 
were fixed by positive selection between species is very high 
within all compartments: It reaches approximately 60% in 
exons and approximately 67% in noncoding regions. For de- 
letions, a is approximately 45% in exons but only 32-35% in 
noncoding regions. Estimates of oc obtained for single- 
nucleotide substitutions are much lower than for indels: 
approximately 29% for missense substitutions and approxi- 
mately 0% for single-nucleotide substitutions within noncod- 
ing regions. 

The estimates of oc, especially those for single-nucleotide 
substitutions, range widely between the published analyses 
(Fay et al. 2002; Smith and Eyre-Walker 2002; Sawyer et al. 
2003; Bierne and Eyre-Walker 2004; Andolfatto 2005). Our 
estimates for the amino acid-changing substitutions (29%) 
and single-nucleotide substitutions in the noncoding regions 




-0.2 



exons mtergenic 
(inframe regions 
indels) 



long short 
introns introns 70 
to 300nt 



Fig. 4. — a for indels and single-nucleotide substitutions in the differ- 
ent genome compartments: exons (inframe indels or missense substitu- 
tions), intergenic regions, long introns (>300nt), and short introns (70- 
300 nt). Light gray, insertions; dark gray, deletions; yellow, missense sub- 
stitutions; and blue, single-nucleotide substitutions within the noncoding 
regions. Error bars are 95% confidence intervals based on 1 ,000 bootstrap 
trials. 



(0%) are near the low end of this range. We investigated 
these differences, comparing our approach with that used in 
another recent analysis (Parsch et al. 2010), and found that 
they are due to several causes: 1) different data sets used 
(a whole-genome data set in our analysis and a set of 
119 genes in Parsch et al. 2010) and 2) different methods 
for estimating divergence (only substitutions along the D. mel- 
anogaster lineage in our analysis, and all substitutions 
between D. melanogaster and D. sechellia in Parsch et al. 
2010) (table 1). Calculating divergence as the number of dif- 
ferences between two species is the approach used in the 
original test of McDonald and Kreitman (1991) and still used 
conventionally in most (Andolfatto 2005; Begun et al. 2007; 
Shapiro et al. 2007; Parsch et al. 2010), although not all 
(Mackay et al. 2012) studies. The difference between the 
two approaches may arise, for example, due to the sites poly- 
morphic in non-melanogaster lineage contributing to diver- 
gence, which may bias estimates of a (Keightley and Eyre- 
Walker 2012). 

Estimates of a also heavily depend on the neutral standard 
used. Whole-genome data analysis indicates considerably 
lower values of oc when positions 8-30 in introns <65nt 
were used as a neutral standard rather than synonymous 
sites (table 1). The difference probably arises due to negative 
selection favoring C and G in the third positions of codons 
(Vicario et al. 2007; Zeng and Charlesworth 2009 and fig. 5), 
which biases synonymous sites-based estimates of oc upward. 
Although this selection is weak (A/ e s < - 1 ), it reduces the over- 
all divergence at synonymous sites relative to positions 8-30 
within introns <65 nt by a factor of approximately 1 .3 (fig. 5). 
In our analysis, use of short introns instead of synonymous 
sites leads to reduction of oc for nonsynonymous replacements 
from 39% to 20% in the autosomal regions and from 56% to 
40% on the X chromosome (table 1). The difference was 
much more moderate in Parsch et al. (2010) and also was 
not observed in reanalysis of our data for the 119 genes 
that they had used (table 1), suggesting that these genes 
may represent a biased subset of the genome in this respect. 
Using short introns, rather than synonymous sites, as a neutral 
standard also drastically changes the estimates of oc for the 
noncoding regions (table 2): We see no evidence of positive 
selection (oc ~ 0%) in the intergenic regions, contrary to some 
previous evidence (Andolfatto 2005). The lack of evidence of 
positive selection in the noncoding regions was also indicated 
in Shapiro etal. (2007). 

Discussion 

Data on within-population polymorphism can be used for 
measuring the relative rates of mutations of different types. 
For this approach to be valid, it is essential to consider 
only those genomic segments where the role of selection is 
minimal. Also, it is preferable to take into account only 
low-frequency polymorphisms, which are least affected by 
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Table 1 

Comparison of Different Methods for Calculation of a (%) for Nonsynonymous Substitutions 



Divergent Sites 


Neutral Standard 




Our Calculations 




Parsch Calculations 






Whole-Genome Data 


Parsch et al. (2010) 


Parsch et al. (2010) 










Gene Subset 


Gene Subset 






Autosome 


X-Linked 


Autosome 


X-Linked 


Autosome 


X-Linked 


Dmel 3 


Synonymous (4fold) sites 


39.0 (37.8 to 40.2) 


56.1 (53.8 to 58.3) 


43.7 (20.5 to 62.7) 


73.4 (65.5 to 79.7) 






Dmel a 


8-30 in intron <65nt 


19.6 (16.7 to 22.1) 


40.4 (34.0 to 46.4) 


42.2 (-7.7 to 69.2) 


74.1 (50.5 to 85.3) 






Dmel-Dsec b 


Synonymous (4fold) sites 


49.4 (48.6 to 50.2) 


66.9 (65.4 to 68.3) 


48.8 (30.3 to 63.9) 


81.7 (76.8 to 85.8) 


49.0 


74.3 


Dmel-Dsec b 


8-30 in intron <65nt 


45.2 (43.8 to 46.7) 


63.1 (59.6 to 66.3) 


52.5 (27.0 to 69.0) 


80.7 (64.3 to 88.1) 


43.7 


79.7 



Note. — The method used in our analyses is marked in bold. Confidence intervals based on 1,000 bootstrap trials are shown in parentheses 

a Only differences fixed in the Drosophila melanogaster (Dmel) lineage are used. 

b AII differences fixed between D. melanogaster (Dmel) and D. sechellia (Dsec) are used. 



A Polymorhic, DAF<15% 




A(T)->G{C) G{C)->A(T) A(T)->T(A) A(T)->C{G) C(G)->A(T) C{G)->G(C) 



Fig. 5. — Negative selection on synonymous sites. Panels A-C show the numbers of single-nucleotide substitutions per nucleotide site. Green bars 
correspond to mutations within 8-30 nt in introns <65 nt, and blue bars correspond to mutations within the 4-fold-degenerate sites. (A) polymorphisms with 
DAF<15%; (B) polymorphisms with DAF>15%; and (Q fixed mutations. In panel D, each bar is the ratio of the per-site numbers of single-nucleotide 
substitutions observed within the 4-fold-degenerate sites and in positions 8-30 of short introns. Error bars are 95% confidence intervals based on 1,000 
bootstrap trials. 
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Table 2 

Comparison of Different Methods for Calculation of a (%) for Substitutions in Intergenic Regions 

Divergent Sites Neutral Standard Autosome X-Linked 

Dmel a Synonymous (4-fold) sites 24.6 (23.6 to 25.4) 31.5 (29.4 to 33.3) 

Dmel a 8-30 in intron <65 0.5 (-2.4 to 3.5) 6.8 (-4.1 to 15.8) 

Dmel-Dsec b Synonymous (4-fold) sites 21.2 (20.5 to 22.0) 27.6 (25.7 to 29.2) 

Dmel-Dsec b 8-30 in intron <65 14.6 (12.7 to 16.5) 19.1 (12.1 to 25.0) 

Note. — The method used in our analyses is marked in bold. Confidence intervals based on 1,000 bootstrap trials are shown in parentheses. 

a Only differences fixed in the Drosophila melanogaster (Dmel) lineage are used. 

b AII differences fixed between D. melanogaster (Dmel) and D. sechellia (Dsec) are used. 



selection. Our analysis of very rare (DAF<3%) polymorphic 
indels in the less conserved regions of the D. melanogaster 
genome follows these requirements. Using short introns and 
intergenic regions for inferring the indel mutation rates pro- 
duces similar results, further validating this approach. Our es- 
timates for the relative rates of insertions (0.035-0.039 per 
single-nucleotide substitution) and deletions (0.092-0.098) 
are similar to those obtained by others (0.015-0.05 and 
0.1-0.2, respectively) (Blumenstiel et al. 2002; Petrov 2002). 
Our estimates of the mean insertion (3.23) and deletion (4.78) 
lengths are lower than those obtained previously (Blumenstiel 
et al. 2002), mainly because we only analyze indels with the 
lengths up to 30. Even within our range of lengths, longer 
deletions are underrepresented in short introns (fig. 1), which 
is consistent with strong selection against very short introns 
and against indels affecting splice sites. 

Our results, as well as those of others, show that insertions 
and deletions make a large contribution to the mutational 
landscape: The numbers of substituted, inserted, and deleted 
nucleotides per generation are all of the same order of mag- 
nitude. Moreover, we observed a strong deletion bias among 
the low-frequency polymorphisms of D. melanogaster, consis- 
tent with previous observations of deletion bias for fixed indels 
in unconstrained regions (Petrov et al. 1 996). In the absence of 
opposing forces, this bias would lead to a rapid genome con- 
traction. However, selection reduces this bias considerably, 
because negative selection eliminates deletions faster than 
insertions (fig. 3; Comeron and Kreitman 2000; Parsch 
2003; Ometto et al. 2005), whereas positive selection favors 
deletions more than insertions (fig. 4). 

Selection acting on indels and single-nucleotide substitu- 
tions can be either weak or strong; weak selection substan- 
tially reduces the prevalence of polymorphisms only at high 
allelic frequencies, whereas the signature of strong selection 
can be detected even at low allelic frequencies. Selection is 
strong enough to reduce the numbers of polymorphisms only 
in the regions with the highest selective constraints. In exons, 
even the inframe indels are strongly deleterious. Selection 
against inframe indels is similar in magnitude to selection 
against missense single-nucleotide substitutions (fig. 2A) and 
is independent of the indel length for the lengths at least up 
to approximately 7 codons (supplementary fig. S3, 



Supplementary Material online). The lack of dependence of 
selection on the indel length is somewhat unexpected, sug- 
gesting that indels of lengths between 1 and 7 codons have 
similar chances to disrupt the gene function, probably because 
a typical functional segment of a protein is much longer than 
that. Similarly, frameshifting indels are about as deleterious as 
nonsense mutations. The frequency of frameshifting indels 
suggests that they are responsible for a large fraction of 
gene knock-outs, because the expected number of frame- 
shifting indels is about the same as the number of nonsense 
mutations (supplementary fig. S2, Supplementary Material 
online). 

The high prevalence of low-frequency indels and SNPs in 
introns and intergenic regions and the lack of difference be- 
tween these classes of genome compartments (fig. 2, upper 
row) suggest that strong selection affects them only rarely, if 
at all. However, such mutations are still subject to weaker 
selection, which prevents them from reaching high frequen- 
cies. In fact, the majority of mutations that we observed are 
weakly deleterious (fig. 3). 

Within each genome compartment, weak selection affects 
a larger fraction of indels than of SNPs, and, among indels, a 
larger fraction of deletions than of insertions. Therefore, 
throughout the genome, the mutational deletion bias is lar- 
gely compensated by stronger selection against deletions. As a 
result, at high DAFs, the genome-wide deletion bias is reduced 
to 0.08 ±0.04 deleted nucleotides per neutral single- 
nucleotide substitution, compared with 0.30 ±0.05 for de 
novo mutations. 

The remaining deletion bias is further compensated by se- 
lection at the level of interspecific divergence: In all genomic 
compartments, the insertions are more often advantageous 
than deletions. Insertion-biased gene conversion (iBGC) may 
also play a role in the fact that insertions segregate at higher 
frequencies and in the increased probability of fixation of in- 
sertions. However, gene conversion in D. melanogaster is 
significantly biased toward insertions only for very short 
indels (<5 nt) and only in highly recombining regions 
(Leushkin and Bazykin in review). The contribution of iBGC 
to the evolution of the genome size is minor: Because of 
this process, the overall gain is only approximately 0.02 nt 
per each neutral single-nucleotide replacement (Leushkin 
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and Bazykin in review). The joint contribution of iBGC and 
selection lead to a further decrease in the deletion bias ob- 
served at the interspecies level: For fixed mutations, it drops to 
0.02 ± 0.02 per neutral single-nucleotide substitution in inter- 
genic regions and to 0.01 ±0.01 in exons. 

In summary, the strong deletion bias observed among the 
de novo mutations has little effect on the evolution of the 
genome size in the D. melanogaster lineage. The main force 
that seems to be opposing it is weak selection favoring inser- 
tions. Indeed, comparison of different Drosophila species in- 
dicates that the amount of intronic and exonic DNA is largely 
conserved and that the variation in the overall genome size is 
primarily due to the differences in the amount of transposable 
DNA (Clark et al. 2007). 

Our data also reveal weak selection affecting single- 
nucleotide substitutions at synonymous sites of the coding 
exons. At low allele frequencies, the relative rates of all 
types of single-nucleotide substitutions at synonymous sites 
are similar to those in short introns; however, these rates 
become substantially biased at higher allele frequencies and 
especially among substitutions that get fixed between species 
(fig. 5). This difference is caused by selection against the 
W substitutions, which maintains the codon usage bias 
in D. melanogaster (Vicario et al. 2007; Hershberg and Petrov 
2008; Zeng and Charlesworth 2009). Negative selection 
acting on synonymous substitutions suggests that using 
them as a neutral standard can substantially bias the estimates 
of positive and negative selection. 
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